Running head: USING DECISION TREES TO PREDICT CRIME REPORTING
نویسندگان
چکیده
Crime reports are used to find criminals, prevent further violations, identify problems causing crimes and allocate government resources. Unfortunately, many crimes go unreported. The National Crime Victimization Survey (NCVS) comprises data about incidents, victims, suspects and if the incident was reported or not. Current research using the NCVS is limited to statistical techniques resulting in a limited ‘view’ of the data. Our goal is to use decision trees to predict when crime is reported or not. We compare decision trees that are built based on domain knowledge with those created with three variable selection methods. We conclude that using decision trees leads to the discovery of several new variables to research further. Using Decision Trees to Predict Crime Reporting 3 Introduction The financial loss due to violent and personal crimes in 2004 was $15.85 billion (Sedgwick, 2006) and 57.5% of these crimes were not reported to the police (BJS, 2005). Other costs of unreported crimes include counseling costs, alarms, electronic surveillance equipment and indirect costs such as insurance and taxes (Sedgwick, 2006). An ongoing nationwide survey has been in use since 1973 in order to better understand both reported and unreported crimes. The National Crime Victimization Survey (NCVS) is used to gather data on injury, theft, damage, the amount of lost work and other characteristics of the incident, victim and suspect. One of the goals of the NCVS is to understand the quantity of crimes and crime types that are not reported to the police (BJS, 2005). Each year, 45,000 households are interviewed about past incidents where they were the victim and the NCVS is the main source of data on the characteristics of criminal victimizations (NACJD, 2006). In addition, it also describes crime types not reported to law enforcement and the characteristics of violent offenders (NACJD, 2006). The NCVS classifies each incident as a personal or property crime. Personal crimes include rape, sexual attack, robbery, assault and purse snatching. Property crimes include burglary, theft and vandalism. For example in 2005, 51% of personal crimes and 59% of property crimes were not reported (BJS, 2006a). Table 1 shows the large number of personal crimes, by crime type, in 2005 and whether or not they were reported. There were a significant percentage of crimes that are not reported. Using Decision Trees to Predict Crime Reporting 4 Table 1. Number of victimizations, by crime type and whether or not reported (BJS, 2005)
منابع مشابه
Predicting Crime Reporting with Decision Trees and the National Crime Victimization Survey
Crime reports are used by law enforcement to find criminals, prevent further violations, identify problems causing crimes and allocate government resources. Unfortunately, many crimes go unreported. This may lead to an incorrect crime picture and suboptimal responses to the existing situation. Our goal is to use a data mining approach to increase understanding of when crime is reported or not. ...
متن کاملExtraction of Drug Crime Patterns and Identifying People at Risk Using Data Mining Techniques
Introduction: In recent years, technology advancement and the growth of information technology in organizations have provided a huge source of data stored in the field of drug-related offenses. Analyzing these data and discovering hidden patterns in it can help detect and prevent the occurrence of crimes in this area. This paper aimed to identify the susceptible people to drug trafficking in Si...
متن کاملExtraction of Drug Crime Patterns and Identifying People at Risk Using Data Mining Techniques
Introduction: In recent years, technology advancement and the growth of information technology in organizations have provided a huge source of data stored in the field of drug-related offenses. Analyzing these data and discovering hidden patterns in it can help detect and prevent the occurrence of crimes in this area. This paper aimed to identify the susceptible people to drug trafficking in Si...
متن کاملمطالعات درخت تصمیم در برآورد ریسک ابتلا به سرطان سینه با استفاده از چند شکلیهای تک نوکلوئیدی
Abstract Introduction: Decision tree is the data mining tools to collect, accurate prediction and sift information from massive amounts of data that are used widely in the field of computational biology and bioinformatics. In bioinformatics can be predict on diseases, including breast cancer. The use of genomic data including single nucleotide polymorphisms is a very important ...
متن کاملPredicting The Type of Malaria Using Classification and Regression Decision Trees
Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...
متن کامل